Collective Data Mining: A New Perspective Toward Distributed Data Mining

نویسندگان

  • Hillol Kargupta
  • Byung-Hoon Park
  • Daryl Hershberger
  • Erik Johnson
  • Philip Chan
چکیده

This paper introduces the collective data mining (CDM), a new approach toward distributed data mining (DDM) from heterogeneous sites. It points out that naive approaches to distributed data analysis in a heterogeneous environment may face ambiguous situation and may lead to incorrect global data model. It also observes that any function can be expressed in a distributed fashion using a set of appropriate basis functions and orthonormal basis functions can be eeectively used for developing a general framework for DDM that guarantees correct local analysis, resulting in desired global data model using minimal data communication. The paper develops the foundation of CDM, discusses decision tree learning and polynomial regression in CDM for discrete and continuous variables, and describes the BODHI, a CDM based experimental system.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Integration and Interaction of Distributed Data Mining with Agent Technology

In recent years, more and more researchers have been involved in research on both agent technology and distributed data mining. A clear disciplinary effort has been activated toward removing the boundary between them, that is the interaction and integration between agent technology and distributed data mining. We refer this to agent mining as a new area. The marriage of agents and distributed d...

متن کامل

Distributed Data Mining and Agent Mining Interaction and Integration: a Novel Approach

In recent years, more and more researchers have been involved in research on both agent technology and distributed data mining. A clear disciplinary effort has been activated toward removing the boundary between them, that is the interaction and integration between agent technology and distributed data mining. We refer this to agent mining as a new area. The marriage of agents and distributed d...

متن کامل

Interaction and Integration of Agent Mining in Distributed Data Environment

In recent years, more and more researchers have been involved in research on both agent technology and distributed data mining. A clear disciplinary effort has been activated toward removing the boundary between them,that is the interaction and integration between agent technology and distributed data mining. We refer this to agent mining as a new area. The marriage of agents and distributed da...

متن کامل

Clustered Collaborative Filtering Approach for Distributed Data Mining on Electronic Health Records

Distributed Data Mining (DDM) has become one of the promising areas of Data Mining. DDM techniques include classifier approach and agent-approach. Classifier approach plays a vital role in mining distributed data, having homogeneous and heterogeneous approaches depend on data sites. Homogeneous classifier approach involves ensemble learning, distributed association rule mining, meta-learning an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999